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Abstract —In cloud computing, users scale their resources 
(computational) based on their need. There is massive literature 
dealing with such resource scaling algorithms. These works 
ignore a fundamental constrain imposed by all Cloud Service 
Providers (CSP), i.e. one has to pay for a fixed minimum 
duration irrespective of their usage. Such quantization in billing 
cycles poses problem for users with sporadic workload. In recent 
literature, Cloud Broker (CB) has been Introduced for the benefit 
of such users. A CB rents resources from CSP and in turn 
provides service to users to generate profit. Contract between 
CB and user is that of pay-what-you-use/pay-per-use. However 
CB faces the challenge of Quantized Billing Cycles as it negotiates 
with CSP. We design two algorithms, one fully online and the 
other partially online, which maximizes the profit of the CB. 
The key idea is to regulate users demand using dynamic pricing. 
Our algorithm is inspired by the Ski-Rental problem. We derive 
competitive ratio of these algorithms and also conduct simulations 
using real world traces to prove the efficiency of our algorithm. 

I. INTRODUCTION 

A. Overview 

There is no universal definition of cloud computing. How¬ 
ever as far as our research is concerned, the most apt def¬ 
inition of cloud computing is found in III which can be 
quoted as: ’’computing as a utility”. In our day to day life 
the most common utilities are electricity, water, gas, heat, 
postpaid mobile services etc. Similarly in cloud computing, 
computing resources (like CPU, memory, storage, network 
domains, virtual desktop) are rented to users based on their 
demand. From user’s viewpoint, it eliminates the need of an 
upfront investment as an user can pay based on the amount of 
resources it has used. This is termed as “pay-per-use” or “pay- 
as-you-go” model. Therefore, resource scaling is the most 
fundamental aspect of cloud computing and hence extensive 
amount of effort has been channeled to explore this area. 
Cloud Service Providers (or CSP’s), like Amazon, ElasticHost 
etc, rent computing resource to the users in form of Virtual 
Machines (also called instances) or VMs. Scaling of VMs 
revolves around two fundamental questions: 

1. From Users Perspective: How to scale VMs to optimize 
a certain objective? 

2. From CSP’s Perspective: How to support the active VMs 
using minimum number of physical servers? 

In cloud computing literature the former is often called auto 
scaling while the latter is called dynamic provisioning. These 
two questions are indeed similar and rely on a common line of 
research. Various researchers approached these two problems 
with different objectives and by using different mathematical 
tools. In the following we will give a brief account of these 
approaches. 


Concerning auto scaling, the most simplest methods are 
the Static Threshold based policies where the scaling of 
VMs is triggered when certain CPU parameters (CPU load, 
response time) crosses a pre-defined threshold. Such methods 
are too simple and if not designed properly face the problem 
of limit-cycles. Queuing theory based techniques are also very 
well developed. Some well cited papers in this regard can be 
0, 0. Theoretic results available in queuing theory uses 
simplified models and hence cannot be directly applied to 
complex systems like those encountered in cloud computing 
framework. Also the results are statistical implying that the 
analysis is true only over a long run after the system reaches 
steady-state. Control Theory based approaches aim to solve 
the latter problem by trying to control the transients a, 0. 
They are generally reactive methods, and hence are not good 
for workloads which have a lot of sudden spikes. There is 
a lot of active research concerning proactive methods. They 
depend on predictive abilities and use tools like Model Pre¬ 
dictive Controller Q, Time Series Analysis 0, 0, Machine 
Learning ifTOll . Work done in 0 is impressive in the sense that 
the prediction algorithm has low overhead and also has the 
ability to distinguish fine workload patterns. There are several 
tasks which are deadline sensitive. There is a separate line 
of research based on heuristic algorithms CD, oa to handle 
these tasks. 

Similar tools have been used to optimize performance of 
data centers. Queuing theory has been widely used. ca 
is a good reference in this regard as it uses much less 
approximation to model a cloud data center. Control Theoretic 
approaches are very famous. One popular book dealing with 
this subject is iH. ca is as well cited paper which presents 
practically viable method to regulate a performance metric of 
data center using feedback control. A very interesting method 
to save energy of a cloud data center has been presented 
in ca where they use feedback control to regulate CPU 
clock frequency based on incoming request rate. Optimization 
based techniques have been directly used to design resource 
allocation strategies to maximize profit of cloud data centers 
CD, ca. Another line of research considers optimizing the 
electricity and the cooling cost required to run a data center. 
Such problems are not trivial when electricity ca and the 
water cost 1201 are time varying. 

B. Quantized Billing Cycles and the Cloud Broker 

There is a fundamental misconception regarding resource 
scaling in cloud computing literature. Definitely, the main 
objective of cloud computing is elasticity, i.e. the user can 
scale-up or scale-down VMs based their short term needs. 



Fig. 1. A part of the google cluster usage traces showing spikes in user 
demand. Red circles indicates the spikes. 

However this “short term” is not infinitely small in the sense 
that it may not be possible to return the VMs in the very next 
instant after it was rented. 

We will explain this situation using an example. Consider 
that we are using on-demand instances of Amazon EC2 
to satisfy these demands. Billing Cycle of AmazorLEC2 on 
demand instance is 1 hour, i.e. you have to pay the same 
price if you use the VM for 1 min or 1 hour. We call 
this phenomenon as Quantized Billing Cycles (abbreviated as 
QBC) in the rest of the paper. This leads to serious problem 
especially for those users whose demand pattern is sporadic in 
nature. For e.g. Consider the workload trace shown in Figure 

which shows the number of VMs required to satisfy user 
demand. There are few spikes in the demand curve to satisfy 
which one needs to buy extra VMs. These VMs may be in use 
for a very small fraction of 1 hour after which it may be idle. 
However we cannot return these VMs before 1 houf] Hence 
there will be wastage of VMs. 

To mitigate this problem to some extent the concept of cloud 
broker has been proposed in recent literature GI], 1221, ED. 
A broker forms a middle man between the CSP and the general 
users as shown in Figure]^ The users send their job requests to 
the cloud broker. The cloud broker rents VMs from the Cloud 
Service Provider to service these demands. The cloud broker 
charges the user based on the fraction of VMs resources used 
to service the job request. This is called pay-what-you-use. 
It can also be based on per-request-basis. This transfers the 
challenge of QBC from the users to the cloud broker. However 
this issue is not as critical for the cloud broker as it is for the 
users. This can be understood as follows: 

1. QBC poses problems for those users whose demand is 
sporadic. Higher the sporadic nature, greater the loss. 

2. Cloud broker serves the aggregate demand of many users. 
In statistical sense, the sporadic/spiky nature in aggre¬ 
gated demand should be lesser compared to individual 
user’s demand. This is because a summer is a discrete 
integrator, a low pass filter which removes the noises. 

In the rest of the paper we will concentrate on how a cloud 

'The user may choose to return the VM but it won’t get any financial 
benefit. Therefore it is better to keep the VM for the entire billing cycle even 
though it remains idle. This is called smart-kill. 
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Fig. 2. Schematic showing cloud broker as a middleman between Cloud 
Service Provider (CSP) and the users. Adapted from (23). 

broker can maximize its profit under QBC. 

Remark: We would like to stress that the mathematical 
formulation, algorithms and analysis presented here can be 
applied to a single user without any change. However given 
that a cloud broker has a smoother workload pattern compared 
to a single user, a broker will benefit more compared to a user. 

C. Existing literature and our contribution 

In our problem, every time a cloud broker has to buy a VM 
it faces the risk of under utilization of the VM in the subse¬ 
quent time slots. The broker has to make a decision without 
knowledge of future demand. Study of such problems comes 
under the category of Online Algorithms, more specifically 
the Ski-Rental problems. There have been a few applications 
of Ski-Rental literature to solve real world problems. In this 
section we will discuss the ones pertaining to cloud computing. 
These works has close resemblance with our problem. 

We derive our motivation from the work done in ll24l . ll^ . 
In these two works the cloud broker has to decide in each time 
slot whether to reserve instances or to serve the demand using 
on-demand instances. If the demand persists for a long time 
then reserving is a better option. However if the demand falls 
down quickly it is better to use on-demand instances. Cloud 
broker has to make this decision without any knowledge of 
future demand or with partial knowledge of future demand. 
The authors designed both deterministic and randomized on¬ 
line algorithms and also derived their competitive ratio. ina 
and ESi considers the same problem however the model used 
in ll25l is more general. It tries to optimize the switching cost 
and electricity cost of a data center. If there is a decrease in 
demand then the data center has to decide whether to shut 
down few physical servers or let them run. To run a physical 
server one needs to bear the electricity bill while switching 
down the server incurs wear-and-tear cost. Wear-and-tear cost 
is higher than electricity bill in short run but smaller in the long 
run. If one decides to switch off the server and immediately 
later it is needed, then one saves a small electricity bill in 
the expense of suffering a large switching cost. Therefore 
taking such decisions without knowledge of future demand is 
difficult. In ll25l . author designed only deterministic algorithms 
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to tackle this problem while ll2^ considers both deterministic 
and randomized algorithms. A recent work presented in ll27ll 
shows that the competitive ratio of the randomized algorithms 
can be improved if statistical information (like first and second 
order moment) of the demand process is available. 

To the best of our knowledge the references given above 
constitutes almost all the major work dealing with the use 
of ski-rental framework for cloud computing applications. We 
make two main contributions to this literature; 

1. We suggest how dynamic pricing can be used to maxi¬ 
mize the profit of the cloud broker under QBC. This can 
be used as an alternative to the work done in 1^ . ll^ 
specially in the case where the workload is deferrable. 
One can also use this approach alongside ll24l . ll^ . 
These two points will be discussed further in Section [V] 

2. We show that the knowledge of future demand leads to 
better competitive ratio for the Partial Online algorithm. 
In ll26ll such attempts have been made but with a major as¬ 
sumption on the demand graph, i.e. the demand increases 
or decreases no more than one step in every time slot. 

II. Problem Formulation 
A. Motivating Example 

We aim to use dynamic pricing as a control signal to 
regulate user demand. The motivation to use dynamic pricing 
in cloud computing setup is derived from the paper 128]. 
Before proceeding forward with the quantitative formulation 
of the problem, we will first consider an example to illustrate 
our idea. Consider the following scenario; 

1. Cloud broker buys VMs from Amazon at $0,132 per VM. 
This is the cost price of the cloud broker. The billing cycle 
of each VM is 1 hour = 60 min. 

2. Duration of a time slot = 10 min. Hence a VM is active 
for 6 time slots. 

3. Nominal selling rate = $0.03 per VM per time slot. 
Nominal selling rate should be such that if the VM is 
in use for all the time it is active (6 time slots here) 
then the cloud broker should make a profit. In our case; 
$0.03 X 6 = $0.18 > $0,132. Hence the condition holds. 

4. The cloud broker has the freedom of changing the selling 
rat^at the beginning of each 10 min interval. The users 
remains totally aware of the current selling rate. 

We will investigate two cases, one with static pricing and 
the other with dynamic pricing. Relevant graphs are shown in 
Figure In both the cases we start with the assumption that 
at time t = 0 the cloud broker has no VMs. 

Case 1 (Static Pricing): 

In static pricing, the selling price remains constant at 
nominal rate therefore. Demand = Actual Demand. In the 1®* 
interval the demand is of 2 VMs while we have 0 VMs. So 
we buy 2 VMs incurring a cost of 2 x $0,132 = $0,264. 
In the interval, demand is of 10 VMs while we have 
2 VMs (bought in the 1®‘ interval). So we buy 8 VM 

^In the work the term “VM price” inherently means the selling price of the 
VM not the cost price. 



Fig. 3. An example of static and dynamic pricing: a) Graph showing Price 
vs Time for static and dynamic pricing, b) The graph coiTesponding to actual 
demand (for static pricing) and modified demand (for dynamic pricing), c) 
Graph showing actual/modified demand along with the active number of VMs 
for both static and dynamic pricing case. 

incurring a cost of 8 x $0,132 = $1,056. In the next 4 
intervals the demand is less than 10 VM and hence we don’t 
have to buy any more VMs. Therefore the net cost price 
of the cloud broker is $1.32. The net selling price of the 
cloud broker is (2 -f 10 -I- 4 -f 3 -I- 8 -f 4) x $0.03 = $0.93. 
Profit = Selling Price-Cost Price = $0.93-$1.32 = -$0.39, 
i.e. the cloud broker suffered a loss. 

Case 2 (Dynamic Pricing): 

In this case the selling price varies in response to which 
the user’s demand gets modified. This case is similar to 
Case 1 except that in the 2"'^ and the 5*^ interval the 
selling price goes up to $0,045 and $0,038 respectively. The 
cloud broker has to buy 2 VMs and 4 VMs in the 1®* 
interval and 2"^^ interval respectively incurring a cost price of 
6 X $0,132 = $0,792. The net selling price of the cloud broker 
is (2 -f 4 -f 3 -f 4) X $0.03 -f 6 X $0,045 -f 6 x 0.038 = $0,888. 
Profit = SellingPrice-CostPrice = $0.888-$0.792 = $0,096, 
i.e. the cloud broker makes a profit. 

We will encapsulate the idea behind dynamic pricing by 
making the following key observations; 

1. The idea of increasing the selling price is to decrease the 
demand and not to increase the revenue. To understand 
this point consider the 2"^^ interval. The actual demand 
was 10 VMs which could have lead to a revenue of $0.3 
if the rate was nominal. As the selling price increased 
to $0,045 the demand reduced to 6 VMs leading to an 
revenue of $0.27. Therefore the net revenue in the 2"'^ 
interval decreased even though the selling price increased. 
Similar argument is true for the 5*^ interval. Therefore 
whenever there is an increase in price there is a decrease 




















































Fig. 4. Cartoonistic representation of Cloud Brokerage mechanism. 


in revenue (and hence in profit) in that interval. Pictorially 
speaking, the revenue loss suffered in dynamic pricing 
can be captured by the area between the solid black curve 
and the dashed blue curve in Figure [3f- 

2. Yet dynamic pricing makes more profit than static pricing. 
This is because in static pricing, many VMs are underuti¬ 
lized and hence does not contribute to the revenue. This 
is clearly shown in Figure [^. Underutilized VMs for 
static case corresponds to the area between the solid black 
curve and the dashed red curve while that in dynamic 
case corresponds to the area between the dashed green 
curve and the dashed blue curve. Definitely the area 
corresponding to static case is more. 

3. The idea behind dynamic pricing is: “Suffer a small loss 
in one interval by decreasing the demand (Refered as 
Demand Loss later) rather than buying a VM and then 
suffering a major loss in the subsequent intervals due to 
low demand (Refered as VM Loss later)”. 

B. Quantitative Modeling 

Cloud brokerage mechanism basically consist of two blocks: 
1) Resource Scheduler 2) Resource Scaler. This is shown in 
Figure Both resource scheduler and resource scaler are 
controlled by the cloud broker. The job of the Resource 
Scheduler is to schedule the incoming tasks onto the available 
VMs. While doing so it has to consider service level agreement 
(or SLAs). Resource Scheduler can be designed using well 
established theoretic tools as discussed in Section II-AI The 
role of the resource scaler is to rent (scale-up) VMs from 
CSP and also to perform dynamic pricing in order to maximize 
the profit of the cloud broker. In this paper, we assume that 
resource scheduler is already designed and concentrate on 
designing algorithms for Resource Scaler. Note that the design 
of Resource Scheduler and Resource Scaler can be decoupled. 
The Resource Scheduler should just update the Resource 
Scaler regarding the number of VMs required to complete the 
tasks in a given time slot. Before moving forward, please note 


the scale-down valve which is controlled by CSP. As shown 
in Figure this is not in control of the cloud broker. This 
aptly captures the notion of QBC and the process of smart- 
kill. As mentioned in Section [UBj due to the presence of QBC 
there is no point of giving back a VM before its billing cycle 
ends. This will be equivalent to the CSP automatically scaling 
down the VMs after its billing cycle. Therefore from cloud 
broker’s perspective: “Only scaling-up of VMs is controllable 
but scaling-down is not”. 

We will now pose our problem mathematically. We consider 
that the user pays the cloud broker based on per-request/pay- 
what-you-use basis. In such a scenario the resource scaler has 
to solve the following profit optimization problem: 

T 

OPl : max U = V] {jtdt - vQ 

{luvt} 

t 

subject to : > d* ; Vf = 1, 2,..., T 

dt = fidljt) ;Vf = l,2,...,r 

In optimization problem OPl, P is the profit to be maxi¬ 
mized. The term {jtdt — Vt) is the profit at interval where, 
7 t is the selling price per VM per time slot, dt is the number 
of VMs required to service the incoming job request and vt is 
the number of VMs bought at interval. Without any loss 
of generality we normalize the cost price of a VM to 1 unit. 

t 

T is the period of the billing cycle and hence '^he 

2—£—r+1 

number of active VMs in the interval, dt is the modified 
VM demand when the selling price is 74 . The relation between 
the actual demand df and the modified demand dt is captured 
by the price-demand function /(■). If 7 t = 7 *, the nominal 
price, then dt = d^. The revenue earned by selling a VM at 
the nominal price of 7 * for one complete billing cycle is 7 *r. 
For the cloud broker to make profit, 7 *r > 1, 1 being the 
cost price of a VM. It should be noted that all the variables 
associated with OPl lies in the set K_|_. 

In conventional sense, the optimization problems dealt in 
ski-rental framework are minimization problems. Therefore we 
are interested in formulating OPl as an equivalent minimiza¬ 
tion problem. In this regard note that 

T 

P = ^ i'ytdt - Vt) 

T 

T 

= '^b*d*t-{{l*d*t-ltf{d*t,-ft))+vt}] (1) 

i=l 

In equation (0 the first term 7 *d( is not controllable and 
hence the maximization of P becomes equivalent to mini- 

T 

mization of ^ [{'l*dl — "ft f (dl, "ft)) + I’t]- We thus pose the 

t=i 

following optimization problem which is equivalent to OPl: 
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Demand Loss 

OP2 

min L = y 

{it,vA ^ 

h*d*t - ytdt) + 

VM Loss 

subject to 

Vi>dt 

;Vt = l,2,...,r 





dt = / (dt, yt) 

;Vf = l,2,...,T 


Intuitively speaking, OP2 does the following: Consider that 
there is a hike in demand which decays soon. In such a case 
OP2 will increase the selling price to reduce the demand. 
In this way the cloud broker will suffer a small “Demand 
Loss”. Buying enough VMs to support the demand hike is not 
a good option in such scenario as the cloud broker may suffer 
a huge “VM Loss” in subsequent intervals due to underutilized 
VMs. But if the hike in demand persists for a long time it is 
better to buy VMs to support this hike. However OP2 is an 
offline optimization problem, i.e. to solve OP2 we need 
for all f = 1, 2,..., T. It is not possible to know in advance 
if an increase in demand is going to persist or will decay 
soon. The challenge is to design algorithms which can make 
such decisions online based on present and past data. Such 
algorithms are called online algorithms. 

We now define the concept of Competitive Ratio which will 
be used later in Section III to compare the performance of the 
online algorithm with its optimal offline counterpart. Say that 
an online algorithm A and the optimal algorithm OPT (OP2 
here) suffers a loss LA{d*) and Lqpt (d*) respectively for 
a given demand sequence d* = [di ' '^t] ■ Then 

algorithm A is called c—competitive if: 


LA{d*) <c-Loprid*) Vd* e ( 2 ) 

Indeed c > 1. In inequality (|^ we have slightly misused 
the notation. is a T dimensional vector of non negative 
real numbers not transpose of a non negative real number. 

C. Properties and Assumptions 


Properties of Demand and Revenue Function 



Fig. 5. Graphs showing typical: a) Demand Functions b) Revenue Functions. 
Two demand functions and the corresponding revenue functions are shown. 
For the Semi-Infinite case [ 7 * , 00 ) is the operating zone while for Finite 
case [ 7 * , 7 op) is the operating zone. 


in [0,7*], it is obvious that 7/(d*,7) will linearly 
increase in this range. Revenue function is maximum at 
7 = 7 * and hence this is set as the nominal price. 

Let d = f {d* , 7 ). Then according to property 4 we have 

- oo < ^ {yd) < 0 (3) 

Differentiating inequality (|^ using chain rule we get 

, dd ^ dd d 

— oo < d + 7 — < 0 —00 < 7 ^ <- (4) 

07 ay 7 

A demand function satisfying inequality Q will be strictly 
monotonically decreasing if d > 0. Two cases may arise: 

Case 1 (Semi-Infinite Operating Zone): d > 0 for all 
7 G [ 7 * , (X)) and hence the demand function is strictly 
monotonically decreasing in this range. 

Case 2 (Finite Operating Zone): d > 0 for all 7 € 
[ 7 * , 7op)- For 7 = 7 op, d = 0. Due to property 7, d = 0 
for all 7 > 7 op. Therefore the demand function is strictly 
monotonically decreasing in the range [ 7 * , yop)- 

The range of 7 where f {d* , 7 ) is strictly monotonically 
decreasing is called the operating zone. Figure illustrates 
the concept of operating zone for the above two cases. Strict 
monotonic nature of / (d* , 7 ) in the operating zone implies 
that the function is invertible in this range. Mathematically, 
the following function exist in the operating zone. 


Most of the price demand function / (d*, 7 ) (also called de¬ 
mand function) found in real world must satisfy the following 
conditions: 

1 . /(d*, 7 )> 0 . 

2 . / (d*, 7 ) > 0 is monotonically increasing in d*. 

3. f {d* , 7 ) is monotonically decreasing in 7 in the range 
[ 7 *, 00 ) while it is constanj^at d* in the range [ 0 , 7 *]. 

4. The revenue function yf{d*, 7 ), is monotonically de¬ 
creasing in 7 in the range [ 7 *, 00 ). This captures the idea 
that increasing the selling price is to decrease the demand 
and not to increase the revenue. As / (d*, 7 ) is constant 

^We assume that the demand cannot increase above the actual demand d* 
even if the price decreases below 7 * 


l = g{d*,d) (5) 

The function g {■) returns the imposed price 7 * given the 
actual demand dj and the modified demand dt- 

Remark: The role of dynamic pricing is to regulate the 
demand. In the range [0, 7 *], price has no effect on the demand 
and yet the cloud broker will incur a demand loss. Therefore 
to minimize OP2 we work in the operating zone. 

Assumptions in Problem Formulation 

1. We ignore the effect of reputation while formulating our 
optimization problem. By increasing the price we force 
some tasks to exit the queue. By doing this we earn 


















negative reputation of the users. A static demand function 
of the form dt = f (dj, 7t) does not capture the effect 
of pricing history and hence the role of reputation. 

2. The partial online algorithm, discussed later in Section 
m relies on demand prediction for future window w. 
In reality, only an estimate of future demand is possible 
however we assume perfect knowledge of future demand. 

3. In real-life scenario, the relation between the modified 
demand dt and the actual demand d^ is not governed 
by a deterministic function / (•)■ Rather the demand dt 
has a probability distributiorn in the range [0, d^] for 
a given -ft- However many works dealing with social 
welfare maximization using dynamic pricing (like 1 ^ . 
ifSHl l consider such deterministic demand function. We 
also assume the knowledge of / (•). 

4. Define the following function in the operating zone, 

P{d*, d) = ^[d-g{d*, d)] ( 6 ) 

Also define two more variables. 


Pm 

Pm 


min p (d* , d) 

d*GR+,0<d<d* 

max p (d* , d) 

d*m+,0<d<d* 


(7) 


We impose the following constrains on and pm, 

- < Pm < Pm < 1 (8) 

T 

Inequality pM < 1 implies that renting is cheaper than 
buying in short run while the inequality 7 < pm implies 
that buying is cheaper in the long run. We will further 
elaborate this in Section UlI-AI 
Let 7 = p (d*, d). Then according to inequality we have 


Pm < -^ ild) < Pm 

Differentiating inequality (|^ using chain rule we get 


(9) 


Pm < 7 + <Pm ^ 

ad d 


1 ^ Pm 


( 10 ) 


dd ~ d 

According to inequality Q, the following inequality is true 
in the operating zone 


d-dd 


( 11 ) 


Inequality ( fTO) ! and o is simultaneously satisfied if 


Pm-P , dj . 

d ad 


Pm - 7 


This is because 
7 < Pm while 


Pm-l 

d 

Pm-1 


> - 


d ■ 


However 


PM-7 > 

d — 


( 12 ) 


0 for 


that there exist a ^ satisfying inequality 
7 > Pm - Also is the slope of the demand function / (•) 
with respect to 7. Hence inequality \T2 \ upper and lower bounds 
the slope of the demand function for a given d and 7. 

^One may consider that the function / (■) is the mean of this probability 
dish'ibution. 


< 0 for 7 > Pm - It is trivial to observe 
if and only if 
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III. Online Optimization Problem 


A. Ski-Rental Problem 

The ski-rental problem abstracts a class of problem in which 
a player has to decide whether to buy or rent a resource without 
a priori knowledge of the period of usage. Renting is cheaper if 
the period of usage is short while buying is cheaper in the long 
run. In the original problem a skier is faced with the option 
of either buying or renting a set of skis without knowing in 
advance the number of days she will be skiing. 

Cost of buying skis is $1 while renting cost $P per day 
where P < 1. If the skier knows in advance that she will 
be skiing for y days then the choice of buying or renting is 
simple. If p > p then the skier will buy the skis in the very 
hrst day. Otherwise she will keep renting the skis for y days. 

The online case is more challenging. In ski-rental literature, 
the concept of breakeven point is used to design online 
algorithms. Such algorithms suggest that the skier should keep 
on renting the skis till the day when the cost of renting 
nP, is more than the cost of buying, i.e. nP > 1. On the 
day she should buy the skis. These is shown to be the most 
optimal deterministic online algorithm and has a competitive 
ratio of 2. 

Ski-Rental problem has been used to solve real life problems 
like TCP acknowledgement problem ll^ . Bahncard problem 
OTll etc. Its application in cloud computing has already been 


discussed in Section I-C The key step towards using ski-rental 
literature for our problem would be to map the following four 
entities in our context; 1) Renting 2) Buying 3) Buying Cost 
4) Renting Cost. In remaining of this section we will define 
these entities. 

1. Renting: It is the process of decreasing the demand by 
increasing the selling price of the VMs. To decrease actual 
demand dt to modified demand dt we impose selling price of 
It =g{dt, df)- 

2. Buying: It is the process of buying Vt VMs to support 
the modified demand dt- If the number of active VMs in the 
beginning of time slot t is Xt then vt = max (0, dt — Xt)- 

Figure illustrates the renting and buying process. In the 
3’'^* time slot renting is equivalent to reducing the demand 
from 12 to 9 while buying is the process of purchasing 4 
VMs. Similarly in the I'**, 6*^ and the 7*^ time slot there is 
both renting and buying. There is no renting in the 2”'^ time 
slot, we only buy 2 VMs. On contrary there is no buying in 
the Sth time slot, we only rent the demand from 9 to 7. In the 
4*^ and the 5*^ time slot there is neither renting nor buying 
because the number of active VMs in the beginning of these 
time slots is more than the actual demand. 

3. Buying Cost: It is the cost of buying n VMs. As the 
cost price of VM is assumed to be 1 unit, the cost of buying 
n VMs is equal to n units. 

4. Renting Cost: It is the demand loss in a given time slot 
associated with reducing a demand from d to d — n, where 
n < d, when the actual demand is d*- Mathematically 


R {d*, d, n) = g {d*, d) ■ d — g {d*, d — n) ■ {d — n) (13) 














Time Slot 

Fig. 6. Figure showing the actual demand d* (solid black curve), modified 
demand dt (dashed red curve) and the number of active VMs xt in the 
beginning of every time slot (solid blue curve). 

It should be noted that unlike the renting cost found in other 
ski-rental literature, for our case, R is not a constant. It is a 
function of d*, d and n. However we will not mention these 
parameters explicitly for notational simplicity. Please Note: 
“Renting Cost” and “demand loss” means the same and will 
be used interchangeably. 

Now we will explain the importance of inequality constrain 
f). As mentioned before renting cost should be more than the 
buying cost in the long run. Given that the billing cycle is of 
T period, the cost of renting for r period should be greater 
than the cost of buying. Otherwise buying of VMs will never 
be required. The cost of buying n VMs is n while the cost of 
Renting of n demands for r period is Rt. Hence, Rt > n. 
Similarly renting cost should be lesser than the buying cost in 
the short run. Therefore the cost of renting n demands for 1 
period should be less than the cost of buying n VMs. Hence, 
R < n. Therefore to formulate our problem in Ski-Rental 
framework, it is necessary that the following inequality holds 

- <R<n (14) 

T 

Proposition 1: If inequality (|^ is satisfied then inequality [T4| 
will hold true. 

Proof: Consider the following, 

R = g {d*, d) ■ d — g {d*, d — n) ■ {d — n) 


= ^-[e.gid\0)]d0 


pd 


= / p{d*,0)d0 

(15) 

J d—n 


r-d 


> / pmd0 = PmU 

(16) 


d—n 


Equation (15) and inequality (16) comes from the definition 
of p {d*, x) and pm (refer equation ^ and Q). Similarly, 

(17) 


The qualitative interpretation of inequality (16) and inequality 
(17) is that the minimum and maximum renting cost of 1 
demand is pm and pM respectively. If inequality ([^ holds, 
then i < Pm- Substituting this in inequality (16) we get 

n 

-<R (18) 

T 

Similarly if inequality © holds, then pM < 1- Substituting 
this in inequality (17) we get 

R<n (19) 

Combining inequality ( |T8] l and we get inequality ( |T4l i. 
This completes the proof. 

B. Online Algorithm 

As discussed earlier in the previous subsection the concept 
of breakeven point has been widely used to design online 
algorithms in ski-rental literature. The key idea is to keep 
renting till the time renting cost equals buying cost. Here 
buying cost is the breakeven point. If renting cost exceeds 
the buying cost, we buy the resource. In this section we will 
apply these concepts to design two online algorithm; 1) Fully 
Online Algorithm which has no knowledge of future demand 
2) Partial Online Algorithm, which assumes perfect fumre 
demand information for future window w < t. Intuitively 
speaking, both Fully Online Algorithm and Partial Online 
Algorithm works in pessimistic sense. It always assumes that 
an increase in demand is not going to persist and hence it 
reduces the demand by increasing the selling price of the VMs. 
This reduction in demand incurs a renting cost. However at 
every time interval it calculates the Net Renting Cost in the 
past and the future intervals. If the Net Renting Cost exceeds 
1 unit (the buying cost of 1 VM), it buys a new VM. 

The psuedocode of partial online algorithm with future 
window of w is given in Algorithm The same psuedocode 
is applicable for the fully online algorithm if we substitute 
w = 0. Both fully/partial online algorithm consist of three 
basic steps. In the following we will explain these steps in 
relation to the fully online algorithm. 

Step 1. Calculating Net Renting Cost 

We calculate the net renting cost I which we could have 
been saved if we rented 1 less demand in each slot for the 
past T period. Fet Xi be the number of active VMs at time i. 
Then the net renting cost is 

t 

1= R{d*,x^ + l,l) ( 20 ) 

Equation is valid only if -I- 1 < d* ; Vi G 

[f — T -f 1, t]. If for a given interval Xi + 1 > d*, the renting 
cost for that interval is 0. This is done in Steps 4 to 10 of 
Algorithm [T] 

Step 2. Buying new VM 

This is done in Steps 11 to 16 of Algorithm In our case 
the cost of a VM is 1. If /> 1 then the corresponding demands 
should not have been reduced. Rather we should have bought 
a VM to serve them. To compensate for this mistake we buy 


R < pMn 






























Algorithm 1 Partial Online Algorithm with future prediction 
window w _ 

Let Xi be the number of VM’s at time i 

1. Set Xi = 0 ; i = 1, 2,..., T 

2. Predict actual demand d* for i = t, t + 1, ..., t + w. 

3. do 

4. Set Net Renting Cost I — 0. Also set i = t + w — t + 1. 

5. while {i <t + w) 

6. if (xi + 1 < d*) 

7. I = I + [g {d*, Xi + 1) ■ (xi + 1) - g {d*, Xi) ■ Xj] 

8 . end if 

9. 1 = 1 + 1 

10 . end while 

11 . if (I > 1 ) 

12. Buy a new VM: Vt = x* + 1. 

13. Update the number of VM’s that can be used in future; 

Xi = Xi + 1; i = t, t + 1,... ,t + T — 1 

14. Update the number of VM’s in the history indicating 
that previous mistakes have been corrected; 

Xi = Xi + 1; i = t + w — T + 1,... ,t — 1 

15. end if 

16. while {I > 1 ) 

17. if (xt < di) 

18. jt=g{dt,xt) 

19. else 

20. 7 = 7* 

21 . end if 

22. Jump to Step 2. 


a VM in the current time slot. We increase the current and 
the future Xi by 1 to indicate that an extra VM is available. 
We also increase the past Xi by 1 to indicate that a corrective 
measure was taken. We then jump back to Step 1. However if 
Z < 1, we jump to Step 3. 

Step 3. Setting the Selling Price 

Let the number of active VMs in the current time slot be Xt 
after performing Step 2 and Step 3. Then xt is the modified 
demand. So we set our selling price as = 9 {dt,xt). 

The partial online algorithm is almost same as the fully 
online algorithm. The difference lies in the calculation of Net 
Renting Cost in Step 1. In case of fully online algorithm it is 
calculated for the period t — r + l to t while for partial online 
algorithm it is calculated for the period t + w — r + l to t + w. 

Theorem 1: Competitive Ratio of partial online algorithm is 
c (a) = 1 + min ( 1 , pm't (1 — a)) ( 21 ) 

where a = — and w < t. 

T 

Proof: Please refer appendix for the proof. 

Corollary 1: Fully online algorithm is 2-competitive. 

Proof: For fully online algorithm a = 0. Hence c = 
1 + min {1, pmt). According to inequality (HJ we have 



Fig. 7. Graph showing the competititve ratio of partial online algorithm. 

PmT > 1- Also Pm > Pm implying pmt > 1. Therefore 
min ( 1 , pmt) = 1 and hence c = 2 . 

Note that the competive ratio of the partial online algorithm 
can be more explicitly written as 


l+PMT(l-a) ; a > 1- ^ 

As pmt > 1. there always exist an um G (0, 1) such that 

1 + Pmt {1 — a) < 2; Va > om- This gives the theoretical 
guarantee that future demand information indeed improves the 
performance of the online algorithm. Figure]^ shows a typical 
plot of equation ( | 2 l] i. 

IV. Simulation Results 

We performed simulations driven by real world traces to 
validate the online dynamic pricing algorithm proposed in the 
paper. 

The first step is to generate the actual demand curve. To 
do this we have used google cluster usage traces available in 
ll32]| . The actual demand curve is shown in Figure 7b and 7d. 
The curve spans 1 day and is slotted in 5 min interval. Cost 
price of a VM is taken to be 1 unit. A VM has a billing cycle 
of 1 hour and hence r = 12. The next step is to generate 
the demand function. A real world demand function can only 
be inferred by doing a market survey. But for the sake of 
simulation, we have synthesized our own demand function. 
This can be explained in steps: 

1. We consider a demand function of the form ^ = / (7)- 

2. A value of pm and pM satisfying inequality is chosen. 

3. A finite interval of price [ 7 * , 7 ^] is uniformly divided 
into small parts. 

4. For each part we substitute the corresponding value of 7 
and d in inequality ( [T^ . The value of ^ for this part 
is chosen in random such that it satisfies inequality ( [T^ . 
Using this value of d for the next part is calculated. 

As part of this simulation we will conduct two comparative 
studies, first to study the effect of demand prediction and 
secondly the effect of pm- 

To study the effect of demand prediction we first synthesized 
a demand function with pm = ^ and pM = 0.8. We then 










simulated Algorithm 1 for w = Q and w = 4. The results of 
the simulation is clearly shown in Figure 7b and 7c. Compared 
to ru = 4, the reduction in demand is more in w = 0. This 
shows the pessimistic nature of our online algorithm, i.e. it 
prefers reducing the demand compared to buying new VMs. 
With increase in future window w the algorithm tends towards 
the optimal counterpart. The net profit P — (ltdt — Vt) 

for ru = 0 is 781 units while for w = 4 it is 937 units. The 
net profit P is more for w = 4 and hence the net loss L (refer 
OP2) is less. This is in consensus with Theorem 1. 

We next studied the effect of on our algorithm. To do 
this we constructed another demand function with pm = ^ 
and Pm = 0.8. From Figure 7d and 7e we can observe that if 
Pm is high the effect of pricing on demand is less. This is an 
obvious consequence of the definition of pm- 

V. Discussion and Extensions 

We discussed the unique challenge posed by the presence 
of quantized billing cycles. The dynamic pricing strategy 
proposed in this paper can be considered as an alternative 
to the work done in 12 ^ to maximize the profit of the cloud 
broker. Merging our algorithm with that of ll24l should be very 
interesting. Such a merging will lead to a very interesting class 
of problems where the cloud broker has to decide whether to 
a) reduce the demand by increasing the VM price b) buy on- 
demand VMs to support the demand c) reserve VMs. This 
problem is similar to multislope ski-rental problem. 

Two deterministic online algorithms were designed to in¬ 
crease the profit of cloud brokers in the presence of QBC. 
The competitive ratio of both the algorithms were derived. We 
showed the importance of demand prediction by deriving a bet¬ 
ter competitive ratio for the partial online algorithm than those 
found in ski-rental literature. In similar lines we would like 
to explicitly point out that the competitive ratio of Algorithm 
3 of ED, i.e. deterministic algorithm with demand prediction 
window w, has a competitive ratio of l-fpr min (l , which 
is better than that reported in ll24l (refer Proposition 5). This 
result is new in ski-rental literature. 

It has been widely reported in ski-rental literature that 
randomized algorithms has better competitive ratio than its 
deterministic counterpart. Extending our algorithm to its ran¬ 
domized counterpart should be trivial. However deriving a 
better competitive ratio for the randomized algorithm with 
demand prediction may be challenging. 

The key idea of our algorithms is to use pricing signal to 
regulate user demand. One may argue that such an algorithm 
gives poor service to the user as it pushes tasks out of the 
queue in order to maximize cloud broker’s profit. We would 
like to make few comments in this regard; 

1. Those tasks which gets pushed out of the queue can enter 
it again at a later instant. So we are not rejecting the tasks. 
Rather we are deferring it. Our algorithm is specifically 
good if Pm is low. A low pM implies that even with a 
small increase in price there will be significant change 
in demand. Qualitatively, a low pM correspond to tasks 
with low priority. Such tasks can be deferred. 


2. Some literature considers penalizing the cloud provider 
when it fails to meet SLA’s (refer ||33l. Such a practice is 
not encouraged in cloud computing as it is a service ori¬ 
ented computing paradigm. If the cloud provider accepts 
the task, it must meet the SLAs. If it cannot it is better 
that it rejects the task. We are following similar practice. 

3. We can penalize cloud broker to push tasks out of the 
queue by introducing reputation factor. As discussed in 
section II-C we need to consider dynamic price-demand 


function to include reputation factor. Such an extension of 
our work will be challenging but a very fruitful research. 
As an immediate extension of our work we are interested in 


two areas. Assumption 4 discussed in Section II-C may not be 
true for all real world demand functions. We are exploring 
the implication of relaxing it. Our intuition is that if we 
make a small modification in Algorithm 7, the competitive 
ratio will remain the same. We are also actively working on 
the randomized algorithm for this problem. More specifically 
we are exploring the possibility of improving the competitive 
ratio of the randomized algorithm in the presence of statistical 
information about the actual demand. This is motivated by the 
work done in ED- 

APPENDIX 

We will denote the partial online algorithm with future 
window of w by and the offline optimal algorithm OP2 by 
OPT. We will first consider two lemmas which are important 
for the proof. 

Lemma 1: Let buy N^, VMs while OPT buy Nqpt 
VMs throughout the time duration of f = 1 to f = T. Then 
for any demand sequence d^, Nqpt > -^u,. 

Lemma 1 is an obvious consequence of the pessimistic 
nature of the fully/partial online algorithm as discussed in 
Both these algorithms assumes that a rise in 


Section III-B 


demand is not going to persist and hence has the tendency 
to reduce the demand by increasing the selling price rather 
than buying VMs to support the rise in demand. Therefore 
Nqpt > Njjj. Please refer Appendix A of for the proof. 
Lemma 2: The net renting cost of those demands that were 
served by the same VM in OPT should be greater than or 
equal to 1 (the breakeven point). 

Proof: The proof directly follows from the definition of OPT. 
We prove this lemma by contradiction. Consider that in OPT 
a VM was bought to serve demands whose net renting cost is 
less than 1. The loss suffered to buy a VM to support these 
demands is more than the loss suffered to rent these demands. 
Therefore there can be a better algorithm to reduce the loss 
which contradicts the definition of OPT. 

Let Aw and OPT rent ut and Nt demands respectively at 
time t. Let TZ (Am) and TZ (OPT) be the net renting cost of 
Am and OPT respectively. We have, 

T 

n{Aw) = Y,R{dl,dlnt) 


n{OPT) = Y,R{d:,dlNt) 
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Fig. 8. a) Demand function for two different value of pm- b), c) Plots of actual and modified demands and the corresponding pricing to compare the effect 
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Fig. 9. Figure showing the demands served by a VM in OPT. 


Let TZ {Aiu\OPT) denote the net renting cost incurred in 
which is not incurred in OPT. Mathematically 

T 

n{A^\OPT) = ^ nt) -R{d;, dl, Nt)]+ 

where (a:)^ = max(0,a;). We are interested in upper bound¬ 
ing TZ {Ayj\OPT). To do this, consider the demands that were 
rented in but not in OPT. These demands were served 
by not more than Nqpt VMs in OPT. We focus on the 
demands served by one of these Nqpt VMs. This is shown 
in the following figure 

Figurej^shows the demands served by a VM in OPT which 
is bought at time t* and last till t* + t — 1. The shaded areas 
shows the demands which are served by this VM. If the area 
is not shaded then in that time slot the VM does not serve any 
demand. The shaded areas can be scattered anywhere in the 
time interval [t*, f* + r — 1]. According to Lemma 1, the net 
renting cost of the demands depicted by these shaded areas 
should be greater than or equal to 1 . 

Now we investigate how these demands will be treated by 
Au,. While calculating the net renting cost, considers both 
future demand and past demand. This is shown in figure 
by the three arrows. A^, calculates the net renting cost in the 
interval [f-l-w — r+ 1 , t + w] where t is the current time, 
the interval [t + w — t + 1, t] corresponds to the past window 
while [f + 1, f + w] is the future window. If the net renting 
cost in the interval [f-l-w — r+ 1 , t + w] becomes 1 , then 
A,„ will buy a VM. As mentioned before the net renting cost 
in the interval [t*, t* + t — 1 ] is definitely greater than or 


equal to 1 (due to Lemma 1). Therefore there is definitely a 
time t satisfying t* <t <t* +t — w when will buy a VM. 
The demands in the interval [t*, t] will be rented while that in 
the interval [f + 1, f* + r — 1] will be served by the VM. The 
maximum renting cost of the demands in the interval [f*, t] 
is min (1, pM {t — t*))- This can be understood as follows: 

1. The maximum renting cost of one demand is pm (refer 
inequality (17)). If all the intervals in the past window 
[t* , f] has a demand, then the maximum renting cost 
in the past window is pM (t — t*). However the renting 
cost in the interval [t*, t + w] is equal to 1 and hence 
the renting cost in the interval [t*, t] cannot exceed 1. 
Therefore the maximum renting cost in the interval [f*, t] 
is min (1 , pM (t — t*)). 

2 . Due to inequality ([^, there may be a t satisfying t* < 
t < t* + T — w such that pM (t — t*) > 1. 

Therefore if A^, buys a VM at time t then the demands in 
the interval [t*, t] was rented in A^j while it is served by a 
VM in OPT. The maximum renting cost of these demands is 
min (1 , Pm (t — t*)). As mentioned before, t < t* + t — w. 
Therefore the maximum renting cost of those demands which 
are served by a VM in OPT but rented in A^, is upper 
bounded by min ( 1 , pM (t — w)). Given that at most Nqpt 
VMs serve such demands we have 

TZ{Ay,\OPT) < , Pm {t - w)) Nqpt 

= min (1, pMr(l - a)) VopT (22) 

where a = —. Let Sm = min (1, pM (t — w)). We will use 
inequality ( |22l l to upper bound TZ (Au,): 

7^(A^) < TZ{OPT)+TZ{A^\OPT) 

< TZ{OPT)+5mNopt (23) 

Let Lqpt be the net loss incurred in OPT. Then, 

Lqpt = P {OPT) -f Nqpt > Nqpt 


(24) 










































































Similarly let be the net loss incurred in A^. We have, 

= T^iAu])+Ni^ (25) 

< 7^ (OPT) + SmNopt + (26) 

< TAm{l,pMT{l-a))NopT + N^ (27) 

Inequality (26) is obtained by substituting inequality (23) in 
equation (25). We now use Lemma 2 followed by inequality 
(24) in inequality (27) to get 

La„ = SmNopt + N^ 

< SmNqpt + Nqpt (28) 

= {^ + Sm)Nopt (29) 

< {1 + Sm)Lopt (30) 

Therefore the competitive ratio of is c = 1 + Sm = 
1 + min (1, pmt (1 — «))■ This proves Theorem 1. 
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